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Abstract 

In this paper, a downlink communication system, in which a Base Station (BS) equipped with 
M antennas communicates with N users each equipped with K receive antennas, is considered. An 
efficient suboptimum algorithm is proposed for selecting a set of users in order to maximize the sum- 
rate throughput of the system. For the asymptotic case when N tends to infinity, the necessary and 
sufficient conditions in order to achieve the maximum sum-rate throughput, such that the difference 
between the achievable sum-rate and the maximum value approaches zero, is derived. The complexity 
of our algorithm is investigated in terms of the required amount of feedback from the users to the base 
station, as well as the number of searches required for selecting the users. It is shown that the proposed 
method is capable of achieving a large portion of the sum-rate capacity, with a very low complexity. 
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I. Introduction 

Multiple-input multiple-output (MIMO) systems have proved their ability to achieve high 
bit rates on a scattering wireless network [1]. In a MIMO broadcast channel, the base station 
equipped with multiple antennas communicates with several multiple-antenna users. Recently, 
there has been a lot of interest in characterizing the capacity region of this channel [2], [3], 
[4], [5]. In [2]- [4], it has been shown that the sum-rate capacity of MIMO broadcast channels 
can be achieved by applying dirty-paper coding (DPC) [6] at the transmitter. Practical schemes 
for approximate implementation of DPC are proposed in [7], [8], [9], [10], [11], [12]. However, 
achieving the theoretical limits promised by DPC faces many challenges. 

In a network with a large number of users, the base station can increase the throughput by 
selecting the best set of users to communicate with. This results in the so-called "multiuser 
diversity" gain [13], [14]. However, achieving the optimum multiuser diversity gain requires an 
exhaustive search over all possible combination of the users, which is not practical for large-scale 
networks. To overcome this problem, references [15] and [16] propose sub-optimum methods 
for user selection. These methods exploit the multiuser diversity gain, but are based on assuming 
DPC at the base station. 

To avoid the complexity of DPC, the simple precoding scheme of "zero-forcing beam-forming", 
which is also called "channel inversion", is considered by some authors [17], [18], [19], [20]. In 
these works, it is assumed that the users are equipped with a single antenna. Using zero-forcing 
beam-forming, the downlink channel with M transmit antennas is decomposed into N < M 
interference-free subchannels, serving N users. Unfortunately, in cases that the number of users 
is equal to the number of transmit antennas, this method does not offer a good performance [20]. 
However, the case of N > M is more common in practical networks. In this case, selecting 
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the best set of users improves the performance of this scheme significantly [5] , [21] (multiuser 
diversity gain). Due to the high complexity of selecting the best set, reference [22] proposes a 
suboptimum algorithm for user selection in order to maximize the sum-rate. This algorithm is 
based on using zero-forcing beam-forming at the transmitter. The complexity of this algorithm 
is shown to be 0(M 3 N). 

To achieve a good performance by using zero-forcing beam-forming, the selected sub-channels 
must have high gains and be nearly orthogonal to each other. As the number of users increases, 
it becomes easier to satisfy these requirements. However, the exhaustive search for selecting 
the best set of users is very complex. In [23], the authors propose a suboptimum algorithm for 
selecting such a set of users in a downlink environment with large number of single-antenna 
users. This algorithm is similar to the greedy algorithm proposed in [15], with the difference in 
using an orthogonality threshold for selecting the users in each step. As a result, the channel 
vectors of the selected users become nearly orthogonal to each other with considerable gains. 
It has been shown that using this algorithm, the optimum sum-rate throughput of the system 
is asymptotically achieved as N — > oo. However, in their approach, the base station must have 
perfect Channel State Information (CSI) for all users. 

To avoid the huge amount of feedback required by providing perfect CSI to the base station, 
reference [24] proposes a downlink transmission scheme based on random beam-forming relying 
on partial CSI at the transmitter. In this scheme, the base station randomly constructs M 
orthogonal beams and transmits data to the users with the maximum Signal to Interference 
plus Noise Ratio (SINR) for each beam. Therefore, only the value of maximum SINR, and the 
index of the beam for which the maximum SINR is achieved, are fed back to the base station 
for each user. This significantly reduces the amount of feedback. Reference [24] shows that 
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when the number of users tends to infinity, the optimum sum-rate throughput can be achieved. 
However, for practical number of users, it does not perform well [23]. 

In this paper, we consider a MIMO-BC with large number of users and propose an efficient 
sub-optimum algorithm that assigns the coordinates of transmission space to different users in 
order to achieve the best performance in terms of the sum-rate throughput. It is assumed that the 
zero-forcing beam-forming is used at the base station as the precoding scheme. The algorithm 
starts by setting a threshold value. By applying Singular Value Decomposition (SVD) to all users' 
channel matrices, only the eigenvectors whose corresponding singular values are above the set 
threshold are considered. Then, among these candidate eigenvectors, the algorithm chooses a 
set of size M which are nearly orthogonal to each other. For the asymptotic case of N — > oo, 
we give the necessary and sufficient conditions for the threshold value in order to achieve the 
optimum sum-rate capacity, such that the difference between the sum-rates approaches zero. 
The proposed algorithm follows the same approach as that of [23], with a difference in the 
user selection strategy. The main advantage of our algorithm is that the coordinates are selected 
among the eigenvectors with singular values above a given threshold, and for the rest of the 
eigenvectors no information is sent to the base station. Therefore, the complexity of search and 
the amount of feedback required at the base station is significantly reduced. Indeed, we give the 
necessary and sufficient conditions for the threshold value in order to achieve the optimum sum- 
rate, such that the difference between the achievable sum-rate and the optimum value approaches 
zero. 

This paper is organized as follows. In section II, we introduce the system model, and describe 
the proposed algorithm. Sections III and IV are devoted to analyzing the performance, in terms 
of the sum-rate throughput, and the complexity of our proposed algorithm, respectively. Finally, 
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section V concludes the paper. 

Throughout this paper, the norm of the vectors are denoted by ||.||, the Hermitian operation is 
denoted by (.)*, and the determinant and the trace operations are denoted by det(.) and Tr(.), 
respectively. E{.} represents the expectation, notation "In" is used for the natural logarithm, 
and the rates are expressed in nats. RH(.) represents the right hand side of the equations. For 



any given functions f(N) and g(N), f(N) = 0(g(N)) is equivalent to limjv_ 



W) 

9(N) 



< 



oo, f(N) = o(g(N)) is equivalent to limjy^oo = 0, f(N) = Q(g(N)) is equivalent to 

lim^oo g$ > 0, /(AT) = uj(g(N)) is equivalent to lim^ g^} = oo, and f(N) = &(g(N)) 
is equivalent to lini/v^oo g^j- = c, where < c < oo. 



II. System Model 

In this work, a MIMO-BC in which a base station equipped with M antennas communicates 
with iV users, each equipped with K antennas, is considered. The channel between each user 
and the base station is modeled as a zero-mean circularly symmetric Gaussian matrix (Rayleigh 
fading). The received vector by user k can be written as 

y k = H k x + n k , (1) 

where x e C Mxl is the transmitted signal, // e £KxM ^ channel matrix from the 
transmitter to the kth user (assumed to be known at the receiver side), and n k e C Kxl ~ 
CAf(0, Ir) is the noise vector at this receiver. We assume that the transmitter has an average 
power constraint P, i.e. E {Tr (xx*)} < P. We consider a block fading model in which each 
H k is constant for the duration of a frame. The frame itself is assumed to be long enough to 
allow communication at rates close to the capacity. 
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The maximum achievable sum-rate capacity in MIMO-BC, denoted as TZ 0pt , is equal to [2] 



where Q n is the transmit covariance matrix of the nth user, and the expectation is taken over 
the channel matrices H 1 , ■ ■ ■ ,H N . The capacity achieving transmission strategy is shown to 
involve at least M, and at most M 2 data streams in total [25]. However, experimental results 
show that M data streams are adequate to achieve a significant portion of the capacity [15], 



As discussed earlier, the capacity achieving strategy in a downlink environment requires 
applying dirty-paper coding at the base station, which is not practical in many applications. For 
this reason, it is desirable to utilize a precoding scheme with less complexity. Among the known 
precoding schemes, zero-forcing beam-forming has received considerable attention, as it uses a 
simple structure of channel matrix inversion. This scheme results in having M interference-free 
sub-channels. Although this scheme does not yield a good performance for the case M = N 
[20] l , for the case of iV > M, which is more common in wireless networks, by selecting an 
appropriate set of dimensions, the corresponding performance is shown to be good [23], [22], 
[26]. In this work, using zero-forcing beam-forming at the base station, we propose an efficient 
algorithm to find M coordinates for data transmission, focusing on maximizing the sum-rate 
throughput. 



As mentioned earlier, to maximize the sum-rate using zero-forcing beam-forming, the selected 
eigenvectors must be nearly orthogonal to each other, and their corresponding singular values be 

'The result is derived for the case of single-antenna users 




[16]. 



III. Proposed Algorithm 
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sufficiently large. The measure of orthogonality between two Mxl vectors v and -0 is defined 
as. 

It is evident that the smaller is z(v, %]}), the more orthogonal will be v and 
Using Singular Value Decomposition (SVD), H k can be written as 

H k = U k A k V* k , (4) 

where A k is a K x M diagonal matrix containing the singular values of H k , U k and Vfc are 
K x K and MxM unitary matrices, respectively. Multiplying both sides of © by U* k j, where 
C/fcj is the jth column of U k , it is easy to show that 

r kj = g k jx + w kJ . (5) 

In the above equation, r k j = U* k ^y k , g k j = \J\{k)V* k j, where V k j is the jth column of 
V k and \/Xj(k) is the jth singular value of H k , corresponding to V k j, and w k j ~ CjV(0, 1) 
is AWGN. This equation suggests that for selecting the dimensions with high gains, the norm 
of the equivalent channel introduced by (|5]), namely g k which is equal to ^^/xJ{k), can be 
compared with a threshold. This threshold is set by the base station at the beginning of the 
transmission. Using such a threshold reduces the amount of feedback and the size of search 
space for selecting the coordinates. To satisfy the orthogonality criterion, the base station can 
perform an exhaustive search for finding the "most orthogonal set" 2 among the pre-selected 
eigenvectors. Due to the large complexity of exhaustive search, the coordinates can be chosen 
one by one. In other words, in each step the eigenvector which is the most orthogonal to the 

2 In general, the orthogonality of a set {/li}*^ can be measured by the orthogonality defect, defined as H*=iJ^«J t where 
H=[hT\...\h T M f. 
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previously selected coordinates, is selected. The first coordinate is chosen as the eigenvector with 
the maximum corresponding singular value. The steps of the algorithm are given in the following: 



Proposed Algorithm (Algorithm 1): 

1 . Using SVD, each user computes the eigenvectors and singular values of its channel matrix 
and sends back the singular values which are larger than a predetermined threshold t, along 
with their corresponding "right" eigenvectors, to the base station. The indices of these 
eigenvectors form the following set: 

So = {(k,j)\ \j(k)>t}. (6) 

2. Base station selects the index in So, corresponding to the maximum eigenvalue. Let us 
define this index as (si,di), i.e., the dith eigenvector of the sith user. 

3. Define 

St = S - {(si,^)}, 

and 

Tfc ,(l) = z(V slidl ,V kJ ), V(k,j)eS u (7) 



where z(.,.) is defined in ©. Note that as ||Vfcj-|| = HV^dJI = 1, z(V Sl: d 1 ,V 



\V* _, Vu ■ 

I v si,di v k ,3 



12 



4. For 2 < m < M, repeat the followings: 



{s m ,d m ) = arg min lk,j{rn - 1) 
(k,j)es m -i 

Sm S m — i {("5 m ,C? m )} 

7fej(m) = z(V Smidm ,V kJ ) +7*j(m- 1), V(k,j)eS m . (8) 
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In the above, 7^(771— 1) = YlT=i z {V Sit d i , V k j) is used as the measure of orthogonality between 
a candidate eigenvector V k ,j and the set of previously selected eigenvectors, {V" Si) ^}™^ . 
Since these eigenvectors are nearly orthogonal to each other by the algorithm, with a good 
approximation, 7fcj(m — 1) can be interpreted as the square magnitude of the projection of Vkj 
over the sub-space spanned by {V^^}^ . It is obvious that the smaller is this projection, the 
more orthogonal will be Vkj to this sub-space. The recursive structure of 7fcj(?n) facilitates its 
computation at each step of the algorithm. 

After selecting the dimensions, we construct the "selected coordinate matrix" as 

W= Kj0J 2 J-H0wJ T - (9) 
Using zero-forcing beam-forming, the transmitted vector x can be written as 

x = U x u, (10) 

where u = [-u Sljrfl , • • • ,u SM ,d M \ T is the information vector. Using © and (flOb . the received 
signal over the mth coordinate is equal to 

r s m ,d m s m ,d m Vs m 

= q Q 1 x + u>, w 

= 9s m ,d m ' H ~ lu + W Sm,d m 

= U s d +W S d ■ (11) 

It can be seen that by applying zero-forcing beam-forming, the downlink channel is decomposed 
to M interference-free sub-channels. 
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IV. Performance Analysis 

In this section, we examine the performance of our proposed algorithm in terms of the sum-rate 
throughput. First, we consider the asymptotic case of N — > oo. 

A. Asymptotic Analysis 

The sum-rate capacity of MIMO-BC has been shown to scale as M log log N, as N tends to 
infinity [24]. This implies that to achieve the optimum sum-rate, the singular values corresponding 
to the selected dimensions must behave like log N. In other words, the threshold value should 
scale as log AT. The following theorems indicates this fact with more details: 

Theorem 1 The necessary condition to achieve liniAr^oo Ho pt — TZp m p = is having 

t = \ogN + (M + K -2) log log iV - p(N), (12) 

where p(N) satisfies 

p(N) ~ o(logiV), 

and 

p(N) ~ loglogloglogjV + log[r(X)r(M)]+a;f 1 V 

Proof - We show that by violating any of the above conditions, the optimum sum-rate can not 
be achieved. 

The necessity of p(N) ~ o(logiV): 
It is sufficient to show that 

lim —L- = 1. (13) 

Af^oo log N 
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For this purpose, we consider the following cases: 



Case I; limAr^oo t — oo, liniTv^oo — - — < 1: The achievable sum-rate of the proposed method, 

log N 

denoted by 1Zp Iop , can be upper-bounded as 



ft Prop < E < max ^log(l + P^. 



Ef =1 Pi=P 1=1 



M 

di\\ 2 ) 



E <j max + P i X di(si)) } , (14) 



where g Sid . and A di (sj) are defined in ©. 

Since the optimum sum-rate is shown to be Mlog (-£ log iV + O(loglogiV)) [24], we have 
n 0p t-n Prop > Mlog log N + O(loglogiV)^ -li max ^log(l + PAd,^)) 1 



.1 f 1( »g ( log JV + O (log log N) 



E max £ ,o g (P,A J , W ) + log(l + 7 ^ M ) . 
The right hand side of the above equation can be written as follows: 

„ (a) ( (P/M) M \ 

KH(fT3| > min log M + M log (log N + O (log log N) ) 

M /n 

-^E{logA rfi (^)} + o(- 
= M log (log iV + 0(log log N)) - E { lo g ^ (*)} + 0(- 



(15) 



(b) 

> Mlog (log iV + 0(log log A0)-E<J max logA max (Jf fc ) 



k=l.- ,N 



-(M - l)E{logA/A >t} + 0[ - ) ■ (16) 
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where A max (A) is the maximum singular value of AA* , and A is a random variable, denoting 
an unordered eigenvalue of a K x K Wishart matrix, (a) comes from using the approximation 
log(l + x) ~ O(x), i < 1, noting that the solution to the maximization problem dT4T> satisfies 
Pi\ Si (di) 1, i = 1, • • • , M. (b) results from the fact that excluding the largest maximum 
singular value from the set of singular values, which are greater than t, reduces the expectation 
in the second line of (fT6b . In writing (6), we also used the fact that the eigenvectors and their 
corresponding singular values of a circularly symmetric Gaussian matrix are independent. The 
distribution of A, denoting as /(A) is derived in [1] as 

/W = ^ (M jl + i) ^ - K ^- K ^-^ (17) 
where Lf I ~ K (X) is the associated Laguerre polynomial of order k [27]. Using the above equation, 
it is easy to show that 

J t °° log Xf(X)d\ 



E {log A/A > t} 



log* 



1 - F(t) 

ir ^X 



1 - F(t) 



~ log t + I \ I . (18) 



Indeed, we can write 

E jjnax^logA max (if fc )} < E |^ max^log ||//,|| 2 | , (19) 

where ||A|| 2 denotes the Frobenius norm of matrix A. In [24], it has been shown that with 
probability one, 

max \\H k \\ 2 ~ logiV + O(loglogA^). 

fe=l,— ,N 

Therefore, 

e( max logA max (^)l < log (logiV + O(loglogiV)) (20) 
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Combining (HSl), (IT8b . and (|20b . we get 

Ko P ,-K P „ p > (M-l)log^ + o(!^) +0 (I). (21) 

Consequently, for lim^oo t = oo and lim^oo * < 1, lim^oo 7^ 0p t - 7?Pro P 7^ 0. 

log TV 



Case II; liniAr^oot = c, where c is a constant: In this case, ([To! can be written as 

/ p \ M 

n 0pt -n Prop > Mlogf — logiV + 0(loglogiV)J -^E{log(l + PA rfi ( Si ))} 

^ ' i=l 

> Mlog f ^ log iV + O (log log iV) ] -ijlogfl+P max A max (J/ fc 



-(M- l)E{log(l + PA)/A > t}. (22) 



Similar to (l20b . it is easy to see that 



E|log^l + P fc ma X7v A max (/f fc )J| < logP + logQogiV + O(loglogiV)). (23) 
Indeed, since E{log(l + PA)} < oo, we have E{log(l + PA) /A > t} ~ 0(1). Hence, 

^opt-^ Prop > (M-l)loglogJV + 0(l). (24) 
As a result, limjv^oo 7^o P t — T^Prop 7^ 0. This completes the proof of 

lim — < 1 lim 1Z 0p t - 1Z Pr:op ^ 0. 

N^oo log I\ N^oo 

Case III; limjv-^oc - — - — > 1: Let us define as the probability that the maximum singular 

log N 

value of a randomly chosen user k is greater than t. In [16], it is shown that for a K x M matrix 
A, whose entries are i.i.d Gaussian with zero mean and variance one, we have 

f M+K-2 (_ f \ 

Prob{A max (A) > t} ~ r(M) e r X g } t} [1 + (r 1 )] . (25) 



Therefore, 



» = '"S" [1+Q(r ' )] ' (26) 
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which is independent of k, and we denote it with p. We define L as the number of users 
whose maximum singular values are greater than t. Since L is a binomial random variable with 
parameter p, E{L} = Np. 

Using [28], Theorem 1, we can write 



ftopt-ft Prop > (i-pf^-ny 



>NCSI\ 



(27) 



where TZi = E < max Q n log det ( I M + J2n=i H* n Q n H n ) A >, A is the event that L = 

I ETr(Q n )=P V " ' ) 

0, and TZ^ CSI stands for the sum-rate of MIMO-BC when no CSI is available at the base station, 



conditioned on A. In [29], it has been shown that 



P 



I+ M HkH « 



(28) 



ft NCSI = E Hfc O)nd(<t 

Since linijv-Kx> > 1, using d26l) . it can be easily shown that Np — ► 0. As a result, with a 
similar approach as in [28], we have 



7^ CSI = E^^aogdet 



I+ M HkH « 



A 



0(1). 



(29) 



Indeed, we can write 



Tlx > E{log(l + Pfl 

max ) "max 

<t} 



> E {log(l + P# max ) |fl max < t, # max > log N} Prob{# max > log iV|# max < t} 



> log(l + PlogiV)tf, 



(30) 



where 6» max = max fc \ max (H k ), and = Prob{6> max > logiV|0 max < £}. Using <|26]>, i? can be 
written as follows: 



r) 



1 



t M + K - 2 e- t {l+Q(t- 1 )) 
F(M)T(K) 



N 



1 



[logjV] M + y - 2 (l+Q([log N}- 1 )) 
NT(M)T(K) 



N 



t M + K - 2 e- t (l+Q(t- 1 )) 
T(M)T(K) 



N 



(3D 
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Since Hhitv^oo 1^77 > 1> ^ can ^ e snown that 1? ~ 1 — o(^). Substituting $ in (l30l> . yields 

ft! > log(l + PlogiV)^l- (l)j. (32) 
Using the above equation and d2^l. the right hand side of (|2"71) can be lower-bounded as, 

KH(|22l) > (l-p) JV [loglogiV + 0(l)] 

~ e-" p(1+0(p)) [loglogJV + 0(l)] 

~ log log iV. (33) 

The last line in the above equation follows from limjv^oc, > 1, which incurs Np — > 0. As 
a result, fto P t — ftprop 7^ 0. This completes the proof for the necessity of p(N) ~ o(log AT). 

1 



77ze necessity of p(N) = log log log log iV + log[T(K)T(M)] + uo . , , , x _ 

\ log log log N 

Let p(N) = log log log log iV + \og[T(M)T(K)} + a(N). Suppose that 

p(N) no log log log log N + log[r(K)r(M)] + u ( - ) , (34) 

\ log log log N J 

which incurs a(N) ~ O ^ logl0 g logJv j, or a(N) < 0. Using (EH), we have 

ftopt-ftprop > (1-J))^1-^5 CSI ]. (35) 

Similar to (l2"9t and (l3*2l . under the assumption of (l3*4"l) . it can be shown that 

^ > log(l + PlogiV)(l-o(-^)j, 
ft^SI _ (36) 
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Using the above equations and (l26l) . we can write 

/ +M+K-2 (_ f \ \N 

n 0pt -n Prop > \i- r(M)r g } } [i + oir^j [iogio g iv + o(i)] 



p(N) 



NT(M)T(K) 



1+0 



log log A" 
logN 



N 



[loglogiV + 0(l)] 



~ ex p{-fp^f^jj[l + o(l)][loglogiV + 0(l)] 

~ exp{-e ff(7V) logloglogiV} [log log N + 0(1)] [1 + o(l)] 

~ Mexp{[l - e^] log log log [1 + o(l)] . (37) 

Under the assumption of (l34b . in the case of <r(N) = O ( : ), i.e., 

\ log log log N J 

lim a(N) log log log iV = c < oo, 

N— >oo 

using (l37l) . we have 

^opt-^ Prop > exp{[- ( r(iV)+0((T 2 (iV))]logloglogiV}[l + o(l)] 

~ exp{-<r(iV)logloglogiV}[l + o(l)]. (38) 

Hence, 

lim 7e 0p t - Tiprop > e~ c 

^ 0. (39) 

Also, in the case of a(N) < 0, using (1371) . we have 

lim T^opt - 7£p r0 p > 1 

N^oo 

0. (40) 

This completes the proof for the necessity of p(N) = log log log log N + \og[T(K)T(M)] + 
1 



log log log A" 
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Theorem 2 The sufficient condition to achieve limjv^oo T^opt — TZp Top = is having 

t = \ogN + (M + K -2) log log iV - p(N), (41) 

where p(N) satisfies 

p(N) ~ o(logiV), 

and 

p(iV) ~ log log log log iV + uo (1) . 
Proof - First, we state and prove the following lemmas: 

Lemma 1- Assuming K > 1, define VLj as the probability of existing at least one user from 
which J eigenvectors (J > 1) are selected in Algorithm 1. Setting t = logiV + (M + K — 
2) log log iV — p(N), in which p(N) satisfies the conditions of Theorem 2, we have 

/ o(\o g N)\ 

Vj~0 {-jpzr) ■ (42) 
Proof- Consider the following event 3 : 

A k = {\i(k)>t, i = !,■■■ J, Xi(k)<t, i = J+l,--- ,K}. (43) 

3 We have assumed that the singular values are in the decreasing order, i.e., Ai > A2 > • • • > \k 
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We have 



l-tfJI 2 



Tr{H k H k } 



K 
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i=l 



(44) 



Since t = logiV + o(logiV), we can write 



Prob{A fe } < Prob{||fr fe || 2 > J log N + o(log N)} , 



(45) 



As ||i?/t|| 2 has a chi-square distribution with 2MK degrees of freedom [30], the right hand side 
of d45t can be written as 

x MK-l eX p(_ x ) 



Prob{||// fe || 2 > JlogiV + o(logiV)} 



J log N+o(log N) 
MK-1 



T{MK) 



-dx 



[JhgN + o(\ogN)r ^ JloKN+o(loKN) 



m=0 



([JlogiV]*^" 1 + o{[logN] MK - 1 )) e°^ N ^ 



N J (MK - 1)! 
[log A/]A^-i e °( logiV ) 



N J 



[l + o(l)], 



(46) 



where \&j 



f. Using (l45l) . and (|46T) . we can write Hj as 



(mk-i)\ 

nj = 1 - J] (1 - Prob{A k }) 

[log N]MK-l e oQogN) 



N 



k=l 



< 1 



N J 



N 



1 — exp < N log 



1-*J [ - g J ^ [l + o(l)] 



1 - exp ^ g J Ar ^ [1 + o(l)] 



NJ- 1 







NJ- 1 



(47) 
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As a result, linijv^oo ilj = 0, for J > 1. This implies that as N — > oo, with probability one, 
at most one eigenvector for each user is likely to be selected by this algorithm. This eigenvector 
corresponds to the maximum singular value of that user. 



Lemma 2- Lett = log N+(M+K— 2) log log iV— p(N), in which p{N) satisfies the conditions 
of Theorem 2, and L be the number of users being selected in the first step of Algorithm 1. 
Then, as N — > oo, with probability one 

e P(N) 

L ~rmfm ll+ ° mi (48) 

Proof- Using (l26l) . the probability of a randomly chosen user k being pre-selected in the first 
step of Algorithm 1 can be calculated as, 

p = Prob{A max (if fc ) > t} 



T(M)T(K) ( 1 + ° (rl) ) 



e P(N) 



NY(M)Y(K) 
log log log Ne q{N) 



[1 + o(l)] , (49) 



NT(M)T(K) 

where q(N) = p(N) — log log log log N. Consider the following probability: 

f = Prob {Np(l — e) < L < Np(l + e)} , (50) 
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where e = y / 2T(M)T(K)e q( ± ' . Note that since q(N) = we have lmiTv^oo e = 0. £ can 

be computed as 



[Np(l+e)\ 

I 



t = E ( / H 1 -?) 

/=fiVp(l-e)l 



7V-Z 



| ^ iv>-iv>(l-e) \ _ ( Np(l + e)-Np 



1-2Q 



2y/T^ / iV> 2 

1 r— — exp -— . (51) 



/^v^Vpe V 2(1 -p) 
Substituting p from (|49|) . and having e 2 = 2r(M)r(A')e^ n , we have 

yi \ \ AT exp {-logloglogiVe^l + o(l)] j (52) 
Vlog log log AT / L J 

Thus, limjv- > oo£ = 1. Finally, using (l49l) and (l52l . with probability one we have 

L ~ Np(l + 0(e)) 

e P(N) 

~ f(M)f(7o |1 + o(1)l ' <53) 



Since p(A r ) = o(logAQ, from Lemma 2, it is evident that lim7v->oo = 0. Therefore, only a 
small fraction of users are pre-selected. This results in reducing the amount of feedback sent to 
the base station. 

As shown in Lemma 1, in the asymptotic case of A" — > oo, at most one eigenvector from 
each user is likely to be selected. This eigenvector corresponds to the maximum singular value 
of that user's channel matrix, and is denoted by V iimax . Hence, for the sake of simplicity of 
notation, we define the measure of orthogonality between the users i and j, denoted by 0(i,j), 
as the orthogonality measure between Vj )max and V j imax , defined in © as z (Vj )max , Vj imax ). 
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In other words, 

0(iJ) = \V* hmax V jjmax \ 2 . (54) 
Lemma 3- The probability density function of 0(i,j) defined in A54\l can be computed from 

p 0{l ,,)(z) = (M-l)(l-z) M - 2 . (55) 

Proof- In Appendix A. 

Definition!- A set S = {h i } 1 ^ =1 , in which hi G C lxM , is called e-orthogonal if we have 
z(hi, hj) < e, for every hi ^ hj G S. 

Lemma 4- Let t = log iV + (M + K — 2) log log iV — p(N), where p(N) satisfies the conditions 
of Theorem 2. Then, as N — > oo, the selected coordinates by Algorithm 1 construct an e(N)- 
orthogonal set, with probability one, where e(N) = e -2 ^, and q(N) = p(N) — log log log log N. 

Proof- After selecting the first user, Sx, with largest maximum singular value, the user which 
is most orthogonal to si is selected. In other words, 

s 2 = arg min 0(1, si), (56) 

where <Si is defined in Q. First, we show that the users si and $2 are with probability one e(N)- 
orthogonal to each other, or equivalently, 0(s 2 ,si) < e(N). To do this, consider the following 
probability: 

p = Prob{0(s 2 , Sl ) < e{N)}. (57) 
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Using (l55l) . this probability can be written as 

fi = Prob |minC(/,s 1 ) < e(N)\ 
= l-(Pwb{0(l,s 1 )>e(N)}) L - 1 
= 1- ( [ (M — 1)(1 — z) M ~ 2 dz 

\Je(N) 

= i_[i_ e(iV )](^)(^) 
= l-exp{-(L-l)(M-l)log[l-e(JV)]} 

= l-exp{-(L-l)(M-l) [e(N) + 0{e 2 (N))}}. (58) 

Defining the event V = {Np(l - e) < L < Np(l + e)}, with p and e defined in (03) and (l50l) . 
and using (152b . a lower bound for /x is found as, 

fi > Prob{£>} [l-exp{-(JVp(l-e)-l)(M-l) [e(A^) + O (e 2 (A^))] }] 

( -q(N) \ "I 

e 4 \ r 9 (jv) "1 

exp j - log log log iVe— l + o(l) x 
V log log logJv y I J 

/ logloglogA^e <M m (iV> 11 
1 " GXP { T(M-l)T(K) [1 + 0(1)] /J • (59) 

Since g(A r ) ~ ^(1), the above probability approaches one as N — >• oo. Therefore, with proba- 
bility one users s 1 and s 2 are e(iV) -orthogonal to each other. 

Now, assume that m users, which construct an e(A r )-orthogonal set A m , are selected up to 
the mth step of Algorithm 1. We show that the selected user in the (m + l)th step of this 
algorithm, s m +i, is such that with probability one, A m+ i = A m + {s m +i} is e(N) -orthogonal, or 
equivalently, s m+ i is e(N) -orthogonal to all users in A m . To this end, we define the following 
probability: 

u k>m = Prob{C(si, k) < a, 0(s 2 , k) < a, ■ ■ ■ , 0(s m , k) < a}, (60) 
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where a = z/ fc m is the probability that a randomly selected user k is a-orthogonal to all 
users in A m . This probability can be written as 

m 

u k>m = Prob{C(si, k) < a} JJ^, (61) 

8=2 

where = Prob {0(sj, A;) < a | 0(s 1; fc) < a, • • • , 0(sj_i, k) < a} . From (l55l) . the first term 
in the right hand side of the above equation can be written as 

pa 

Prob {0(si, k) <a} = / (M — 1)(1 — z) M - 2 dz 

Jo 

= l_(l_ a )M-l 

~ (M — l)a + 0(a 2 ). (62) 

In Appendix B, it has been proved that 

m~ (M -t)a + 0(a 3/2 ). (63) 
Hence, using (l6lT) . (l62l . and (l63l) . we can write 

m 

[(M - l)a + 0(a 2 )] [(M - i)a + 0(a 3/2 ^ 

i=2 



x^^ M) ^ m + 0(a m+l ' 2 ) 
T(M - m) v ; 

[{Xf) [[e(N)] m + O ([e(iV)]( m+1 / 2 ))] (64) 



T(M - m)M" 

Now, we define uj m as the probability of existing at least one user a-orthogonal to the users in 
the set A m - Noting that u kjTn is the same for all k, we obtain, 

L—m 

Um = 1 ~ n ^ ~ 

k=l 

= l-exp{(L-m)log(l-^ m )} 

= l-exp{(L-m) [-Vk, m + 0(u£ t J]}. (65) 
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Similar to (l59t . we can compute uo m as, 



1-0 



e * 



Vlog log log N 



exp | — log log log A^e 9(3 ' [1 + o(l)]| 



{M-m)q(N) 

log log log m 
1 - exp { —TTJ^TTT^i 1 + 



(66) 



T(M - m)M m T(K) 

Since m < M — 1, it follows that Khitv-^oo = 1- In other words, as A" tends to infinity, with 

probability one there exists at least one user u rn+1 , a-orthogonal to all users in A m . 

Consider user s m+ i which is selected in the (m + l)th step of Algorithm 1. Obviously, we have 

m m 

220(s m+ i,Sj) < 220(u m+1 ,Sj) 

< ma 

me(N) 
M 

< <N). 



(67) 



Knowing the fact that 0(s m+ i, Sj) > 0, for j — 1, • • -m, we can write 

0(s m+1 ,Sj) < e(N), j = l,---m 

which means that with probability one, s m+ i is e(N) -orthogonal to the users in the set A m , and 
consequently, A m +i is an e(A r )-orthogonal set. 

Let us define X m as the event that the set A m is e(A r ) -orthogonal. We can write 



M 



Prob^A/} = Prob{^ 2 } H ^tob{X m \X m ^}. 



(68) 



m=3 
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From (l59l) and d66b . the above probability is lower-bounded as 



M-l 



Prob{X M } > Yl 



> 



m=2 



1-0 



e 4 



v/log log log iV 



cxp <^ — log log log Ne~^~ [1 + o(l)] f 



logloglogiVe'" M 9(iY> 
1 - exp { ^-^j — T ^ [1 + o(l)J 



T(M -l)T(K) 



x 



M-l 

n 

m=2 



(M-m)q(iV) 

log log log Ne m 
1 - exp { — ZTTT^FTlT^t 1 + o{l)\ 



T(M -m)M m T(K) 



~ 1 — exp 



log log log A^e m 



M-l 



(69) 



T(M - m)M m T(K) 

Therefore, limA^oo ProbjA^/} = 1. In other words, the selected coordinates by Algorithm 1, 
with probability one, construct an e(N) -orthogonal set as A" tends to infinity, which completes 
the proof of Lemma 4. 



As mentioned earlier, after selecting the coordinates, the "selected coordinate matrix", 7i, is 
constructed using ©. By applying zero-forcing beam-forming, the information vector, u, is 
multiplied by 7i 1 to construct the transmitted signal as (ITOb . Using (fTTT) . we can write 



r = u + w, 



(70) 



where r = [r Sl)dl ,-- - ,r w J T , u = [u sudl ,--- } u Sm4m ] t , and w = [w sudl ,--- } w Sm4m } t . 
Having the power constraint P for x, the sum-rate capacity can be computed as [19], 



M 



ftprop = E K t max + P m) \, lm= [(WW)" 1 ] , 



(71) 



spM p <p m=l 



where [A]^ denotes the entry of matrix A in the ith row and the jth column. The optimal P m 's 
in (T7TT) can be obtained by "water-filling". Here, we assume that P m 's are all equal (uniform 
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power allocation). Thus, 



Tr {[WhCf 1 ] 



(72) 



Consequently, 



(73) 



Tr {{WHY 1 } 

where 7£p[. op stands for the sum-rate achieving by the proposed method, when the power is 
uniformly allocated among the coordinates. 

Having defined X M in d6"Sl) and using do^t . the above equation can be written as follows: 



n 



u 

Prop 



E n < Mlog I 1 



P 



K-j-c s M log | 1 + 



> E n < Mlog I 1 



~ I 1 — exp 



E^Mlog I 1 + 



Tr {{WHY 1 } 
P 

{[WHY 1 } 

Tr {[WHY 1 } 

log log log Ne^M~ 
T(M — m)M m Y(K) 

P 



^J(l-Prob{* M }) 
;r A/ jprob{;t A/ } 



M 



(74) 



Tr {[WHY } 
where X^ is the complement of Xm- 

From Algorithm 1, it is obvious that the corresponding singular values of the selected eigen- 
vectors are greater that t = \ogN + (M + K — 2) log log iV — p(N). However, the following 
lemma which is proved in Appendix C, states that the singular values of all selected dimensions, 
with probability one, can not exceed log N + (M + K — 1) log log N: 



Lemma 5- Let t = log iV + (M + K - 1) log log N. Then, 

r? = Prob{ max X max (H k ) > t ) = O ( — *— ) . (75) 
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As a result of this lemma, the singular values corresponding to the all selected dimensions 
can be expressed as log iV + o(logiV). 



To compute the conditional probability < M log ( 1 + 



X M \ , we define B 



WH* . Conditioned on X M , i.e., having e(N) -orthogonality among the selected dimensions, using 
©, and the results of Lemma 4 and Lemma 5, we can write 



&a = \\g a 



\ogN + f(N), 



(76) 



and 



IB 



'j i 



a/O (log A^) x O(logiV) xO(e(N)) 



0(e(N)hgN), 



(77) 



where f(N) ~ o(logiV). In Appendix D it has been shown that any diagonal element of B 1 
can be expressed as [log NY 1 + O \J£~]f), where 



W^max(^,e(iV)) -o(l). 
Having this, and using the fact that Tr { [WHY 1 } = Tr {®~ l }> we can write 



E^ Mlog 1 + 



P 



Tr {[WHY 1 } 



X 



M 



Eg < M log 1 + 



P 



Tr {B^ 1 } 



X 



(78) 



Mlog 1 + 



Mlog 
Mlog 



1 + 



M[\ogNY x + o(0) 
P 

MflogiV]- 1 [l + 0(h(N))] 



P 

M 



logN + 0(h(N) logiV) 



(79) 
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From (1741) and d79t . we have 



74L P > Mlog logiV + O(MiV) logiV)) - exp J_^^^-[i + (1)] 

(80) 

Since adaptive power allocation (using "water-filling") results in higher sum-rate than that of 
uniform power allocation, we have TZp TO p > 'fcpLp- Having the fact that [24] 

^ pt~Mlog^logAA + 0(loglogiV)J , (81) 

and using (I50l) . we have 

n 0p t-n Pmp < M\og(^logN + gi (N)J-Mlog(^logN + g 2 {N)J(l-g 3 (N)) 

& \ PlogN J S V PlogiV J 

Mg 3 (N) log logiV + g 2 (NYj , (82) 

where ^(iV) ~ O(loglogiV), g 2 (N) ~ O (/i(JV) logiV), and 

/■aa / log log log iVe 2 ^ \ / 1 \ 

^ (A ° ~ exp \- r(M-m)M-r W [1 + 0(1)] ) ~ Ibgbgjv J • (83) 

From (T78l) and (l83l) . and Using the approximation log(l + x) ~ x, for i C 1, and we can write 

7Jo Pl -7?P rop ~ M ( Ml9l( ^ (iV)1 ) + M 93 (N) log logiV + gl w) 

~ 0(1). (84) 

Consequently, 

lim 1Zo pt - TZ Prop = 0, (85) 

AT— >oo 

which completes the proof of Theorem 2. 
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Theorem 2 implies that using Algorithm 1, and applying zero-forcing beam-forming at the 
base station, the same performance as when the optimum user selection algorithm and optimum 
precoding scheme is utilized, can asymptotically be achieved. 

Remark 1- Although in the proof of Theorem 2, we showed that limTv-^oo T^opt ~ ^Pro P = 0, 
it is interesting to minimize the order of difference. 
Rewriting (I84L we get 

f loe: loe loe iVe 1/e(7V) 1 

n 0pt -n Prop ~ o (em + e*P {- ^ f * )Mmv{K) f 1 + o(i)] j o(iogiogiv), 

(86) 

where g(N) = max (h(N), lo g,y ), and h(N) is defined in CSJ- Hence, in order to minimize 
the order of difference, we must have h(N) = O ( '"^y ), which incurs e(N) = O ( lo g,y ) 
and /(JV) = O (log log iV). As a result, 

q(N) = —M \oge(N) 

= M log log N - M log log log iV + i/}(N), (87) 

where if)(N) is an arbitrary function with the condition limTv^oo 4>(N) = c > 0. Hence, using 
the definition of q(N) in Lemma 4, we can write 

t = log A^ + (K — 2) loglog A^ + M log log log iV — log log log log A" — 4>(N). (88) 

Also, to guarantee f(N) = 0(loglog AT), we must have 

t = logiV + 0(loglogiV), (89) 

which means ip(N) ~ 0(\og\ogN). Having these conditions on t, we can guarantee 7£. 0p t — 

/^prop ~ u y logN y 
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Remark 2- It is important to note that satisfying limjv- > oc 7Z 0pt — 7Zp rop = 0, is much more 
challenging than that of limjv^oo = 1. The following lemma, which is proved in Appendix 
E, clarifies this fact: 

Lemma 6- Suppose that in Algorithm 1, t — log N, and the coordinates are chosen randomly 
among the preselected eigenvectors. Then, 

lim ^£ = 1. (90) 

The above lemma states that to satisfy liniA^oo -^^f = 1, the orthogonality among the 
coordinates is not a necessary condition. 

B. Comparison with other Downlink Strategies 

In this section, we compare the performance of our proposed scheme with some other downlink 
strategies in terms of sum-rate capacity. To have a good measure for comparison, we give the 
following definition: 

Definition 2- For a MIMO-BC in which a base station, and average power constraint P 
communicating to N users, using strategy S, the multiplexing gain is defined as 4 

r s = lim SK ' \ (91) 

P^oo log P 

and the multiuser diversity gain is defined as 

n s (p,N) 

ds = Jim — : — : -, (92) 

n^oo r s log log N 

4 More precisely, as in [31], r is the maximum achievable multiplexing gain when diversity gain approaches zero. 
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where TZs(P,N) is the achievable sum-rate. 

Lemma 7- Using the proposed algorithm, and applying zero-forcing beam-forming, we can 
achieve r = M, and d = 1, which are the maximum achievable values in a MIMO-BC. 
Proof- Appendix F. 

1 ) Time Division Multiple Access (TDMA): In this scheme, the base station only serves one 
user in each time slot. Hence, to achieve the maximum sum-rate, the user which has the maximum 
single-user capacity should be served. Because of its simplicity, this strategy is widely used in 
the downlink of the cellular networks. The achievable sum-rate of this scheme can be written 

as 

"ft-TDMA = E < max max logdet [I K xK + H k Q k H* k ] > , (93) 

\ k Q k 

K Tr{Q k }=P ) 

where Q k is obtained by "water-filling". Using d9"Tb and d9"21 ). and the result of Lemma 1 in 
[32], the multiplexing gain and multiuser diversity gain for this scheme can be obtained as, 

r ^TDMA(P,iV) 

r TDMA = I™ : " 

P^oo log P 

P\{k) 



Ejmax^ErJi^log 



min(M, K) 



lim 

P^oo log P 

mm(M,K), (94) 



and, 



TI TD ma(P,N) 

iTDMA = 



n^oo min(M, K) log log 



(95) 
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Hence, this scheme achieves the full multiuser diversity gain, while achieving the full multiplex- 
ing gain only in the case of K > M. 

Although this method has been shown to be optimal for single-antenna broadcast channel 
(M = 1) [33], for the case of M > K > 1, as a result of losing the multiplexing gain, 
this method is no longer optimum 5 . 

From the proof of the Lemma 1 in [32], it can be observed that the upper and lower bounds for 
T^tdma have the same behavior asymptotically almost surely, when N — ► oo. In other words 6 , 

inogfl + ^maxA^JT^*)^ ~ K log ( 1 + ^-mzxTr (H k H* k/ 



K k mmv ft * ') \ K 2 k 

~ inog(l + -^logiV), (96) 

where H' k (K x K) is a truncated version of H k by omitting the M — K columns of H k . From 
(|9"6T). and having the fact that X min (H' k H' k *) < \ min (H k H k *), the following observations can 
be obtained: 

Observation 1- For the user which maximizes the single-user capacity in d93t . (I), all the 
eigenvalues should be of the same order. In other words, 

XjiHtH*) ~ ^ + O(loglogiV), j — 1, ■ ■ ■ , K. (97) 

As a result of this, HiH^ tends to the identity matrix. 

Observation 2- The user with maximum single-user capacity has the maximum A m i n , asymp- 
totically. 

For the case of K > M, similar to d9oT) . the asymptotic sum-rate capacity can be computed 

5 For the case of K > M, this scheme is not optimal either. This fact will be discussed in more details later. 
6 It is assumed that K < M. 
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as 

^tdma ~ MlogQ^logJVJ. (98) 

In this case, it can be easily shown that liniTv^oo ^tdma = \ j n other words, the optimum 
sum-rate can asymptotically be achieved. However, the selected dimensions by TDMA belong 
to the same user and have the asymptotic behavior of ^jp-, while in our proposed method the 
selected dimensions belong to different users with the asymptotic behvior of log N. Moreover, 
we have 

P . ,A . / P 



n 0pt -n TDMA ~ Mlog( 1 + — logiVj -Mlog^l + ^logiV 

~ M log M. (99) 

As can be observed from figure El this gap affects the performance significantly, especially when 
M is large. 



2) Random Selection: In this method, the base station randomly selects M users for trans- 
mission. This results in having fairness in the system. This strategy can also be regarded as 
Round-Robin scheduling algorithm, when the users are randomly divided into groups of size M, 
and the base station serves one group in each time slot. 

In Appendix G, it is shown that using multiple dimensions for transmission results in having 
multiplexing gain equal to M. However, because of random selection of the users, this scheme 
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does not provide multiuser diversity gain. More precisely, 

n RS (p,N) 



d RS = lim 



N—>oo M log log iV 



lim 



V.-A, S max 



ETr(Q m ) 



Qm log det [I M + Em=i H* m Q m H m ) \ 
(Q m )=P v J ) 



N^oo M log log N 

N->oc M log log N 

= 0. (100) 

As a result of lacking multiuser diversity gain, this scheme shows a weak performance especially 
for large number of users. (Figure 0) 

C. Simulation Results 

So far, we have shown that as N tends to infinity, our scheme achieves the optimum sum-rate 
which scales like Mlog (-^logiV). In this section, simulation results are provided to examine 
the performance of our proposed scheme in practical networks with finite number of users. 
Figure [l] shows the optimum threshold (computed by exhaustive search) as a function of the 
number of users for M = 2, K = 1, and M = 4, K = 1. These curves show that the optimum 
threshold for each N, lies between log iV — log log N, and log N. 

Figures El presents the plots of the corresponding sum-rate versus the number of users for 
different number of transmit and receive antennas. The Signal to Noise Ratio (SNR), which is 
equal to the transmitted power P, is fixed to 10 dB in all curves. For comparison, the plots of 
sum-rate when using TDMA and Random Selection algorithms, as well as the optimum scheme 
of dirty-paper coding are also given. For Random Selection algorithm, it is assumed that the 
optimum precoding scheme of dirty-paper coding is used. 
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Figure |3] depicts the plots of sum-rate capacity versus SNR (P), for M = 2, K = 1 and 
M = 4, K = 1. The number of users is fixed to 100 in both curves. It can be observed that the 
sum-rate achieving by the proposed scheme shows a linear increase with logP in high SNRs 
with the slope equal to M. This confirms achieving the multiplexing gain of M by the proposed 
scheme. 





Number ol Users 



Number of Users 



Fig. 1. Optimum threshold versus the number of users. 



V. Complexity Analysis 

A. Amount of Feedback 

As can be observed in the proposed algorithm, only the eigenvectors that belong to So, defined 
in ©, must be sent back to the base station, along with their corresponding singular values. For 
the asymptotic case of iV — > oo, from Lemma 2, we conclude that the cardinality of So scales 
as r (Myr(K) • Assuming that for each eigenvector and its singular value 2M real values must be 
fed back, the total number of real values required at the base station is asymptotically equal to 

r(M)r(x) • 
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Fig. 2. Sum-rate capacity versus the number of users, P = lOdB. 



From Theorem 1, we observe that to achieve the optimum sum-rate, i.e., lirriTv^oo T^opt — 
7^-Prop = 0, the following condition must be satisfied: 

p(N) ~ loglogloglogiV + log[r(K)r(M)]+^f i - \ - ). (101) 



log log log N 
As a result, 

A/" Prop ~ 2MlogloglogA^ + cj(l), (102) 
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5 10 15 20 25 30 

P[dB] 



Fig. 3. Sum-rate capacity versus transmit power, N = 100, K = 1. 

where A/p rop stands for the amount of feedback (in terms of the total number of real values 
required at the base station) in the proposed method. From the above equation, it follows that 
the minimum amount of feedback required to achieve the optimum performance is lower-bounded 
by log log log N, in the proposed algorithm. However, in [28], it has been shown that the same 
result holds for any other strategies. 

In order to guarantee limA^oo 7^ 0p t — ^Pro P = in the proposed scheme, using Theorem 2, 
the following condition must be satisfied: 

A/" Prop ~ uj (log log log N). (103) 

Note that the computation of jkj's in Algorithm 1 (eq. ®) can be performed in the mobile 
sides, which reduces the amount of feedback further. This idea is described in details as the 
following algorithm: 

Algorithm 2 (Modified version of Algorithm 1): 
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1. Set the thresholds t and (3. 



2. Define 



So = {(k,j)\ \j{k)>t}. 



For all (k, j) G «So, send \j(k) to the base station. 

3. Let (si, di) = arg max( fcj ) eiSo \j(k). Base station informs the user s x to feed back the 
eigenvector corresponding to its maximum singular value and after receiving it, sends these 
information to all the users in Sq — {(si, di)}. 

4. Define 7/tj(0) = for all (k,j) G Sq. For m = 1 to M — 1 the following steps are repeated: 



\ v l m ,d m v k,j\ 2 + lk,j(m - 1), for all (k,j) G S m . All users in S m feed back their 
corresponding ^ k j(m) to the base station. 
— Select (s m+ i, d m +i ) = arg min (fc j) 6Sm 7jfcj(m). Base station inform the user s m 
to feedback its d m th eigenvector, and after receiving, sends it to all users in 5 m — 



For the asymptotic case of N — > oo, having £ = log iV+(M+i<C— 2) log log iV— log log log log N— 
q(N) and /3 = e~T^, and using equations (l53l and d64l . we have 

M-l 



- Define 5 m = {{k,j)\(k,j) G <S m _i, | V^ ra)dm V kJ \ 2 < (3) and 7fc,j( m ) 




Prop 



^|5 m |+2M 2 



m=0 



M-l 




m=0 



M-l 



0(e — m ) + 2M 2 



m=l 




6 pW 



(104) 



r(M)r(ir)' 
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Figure |4| depicts the plots of the required amount of feedback versus the number of users for 
M = 2, K = 1 and M = 4, K = 1, when Algorithm 1 and Algorithm 2 are used. The measure 
for the amount of feedback is defined as the number of real components per user that should 
be sent to the base station. In these curves, the optimum values for the thresholds (t and (3) are 
found by exhaustive search. 




Fig. 4. Amount of feedback 



B. Search Complexity 

Since at the first step of the algorithm, only a fraction of eigenvectors are pre-selected, the size 
of the search space for next steps is decreased from NK to L. As can be observed, at the mth 
step of the algorithm, the base station searches for the dimension with the smallest 7fcj(m — 1) 
among <S m _i, which requires L — m + 1 searches. Therefore, the total number of searches for 
selecting the desired set is equal to ^2^ =1 (L — m + 1) = ML — MiMzli f which is linear in L. 
Again, we can restrict our search space if the modified algorithm stated in the previous section 
is used. 
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As mentioned earlier, the best M eigenvectors for maximizing the sum-rate capacity can be 
found by exhaustive search. In this case, the size of the search space is equal to ( N ^) ■ 

In the asymptotic case of N — > oo, the total number of searches is Q(e p ^) ~ o(N) for 
the proposed algorithm, which is much less than that of exhaustive search (6(iV M )). Therefore, 
using our algorithm the complexity of search at the base station is decreased significantly. 

VI. Conclusion 

In this paper, we have considered a downlink communication system, in which a base station 
equipped with M transmit antennas communicates with N users, each equipped with K receive 
antennas. We have proposed an efficient suboptimum algorithm for selecting a set of users in 
order to maximize the sum-rate throughput of the system, using zero-forcing beam-forming at the 
base station. For the asymptotic case of N — > oo, we have derived the necessary and sufficient 
conditions to achieve the optimum sum-rate capacity, such that liniAr^oo 72-opt — T^Prop = 0. 
We have also investigated the complexity of our scheme in terms of the required amount of 
feedback from the users to the base station, as well as the number of searches needed for selecting 
the coordinates. The proposed algorithm is compared with some other downlink strategies like 
TDMA and Random Selection algorithms. 

Appendix A; Proof of Lemma 3 

In this appendix, we derive the probability density function of 0{i,j) = |T^* imax ^j,max| 2 - For 
simplicity of notation, V i:milx is denoted by <f>v an d Vj jjnax is denoted by 0-. Since <p { and 
<f>j are the eigenvectors of two independent matrices whose entries are independent CA/"(0, 1), it 
follows from [34] that 4>% an d 4>j are independent isotropically distributed unit vectors in C M , 
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with the following probability density function: 

T(M) 

p*(«) =p^(4>) = -^-Wt-i). (105) 

Indeed, this probability density function does not change by multiplying any M x M unitary 
matrix 0, i.e., 

(106) 

Now, define u = 4>*4>j, and let be a unitary matrix whose first row is equal to 0^ We can 
write 

u = 0*0*0^ 

= [io---o]0; 

= 0-(l), (107) 

where </>^ = 0</>j, and </>j(l) is the first element of Since is unitary, </> • and </>^ have the 
same pdf. Hence, the probability density function of 0,(1) is the same as that of </>j(l), and can 
be computed as [34] 

P»(«) =P^(i)(«) = (1 - \u\ 2 ) M -\ (108) 

Using the above equation, the probability density function of 0(i,j) = \u\ 2 will be equal to 

PO(i,j)(z) = P\u\*(z) 

P\u\{y/z) 
2-K^fzj) u {^fz) 

= (M -1)(1- z) M ~ 2 . (109) 
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Appendix B; Proof of (1631) 

Since the selected vectors {V S3 . imax }*_ 1 are nearly orthogonal to each other, they form a basis 
for the sub-space spanned by them. We call this sub-space Vi \. In the following, we denote 
Vt imax , the eigenvector corresponding to the maximum singular value of user k, by (ft k for the 
simplicity of notation. 

Any vector v G C M can be represented as 



i-l 



V = V ± +J2(<l>s j ,v)<f>s j , (HO) 



where i; 1 - is the project of v on the null space of Vi-i, denoted by V { x , and (^(f> s .,vj = 4>* Sj v. 

Defining the event C{ = {0(si, k) < a, ■ ■ ■ , 0(si-x, k) < a} 1 , the conditional probability in 
d63l can be written as 

m = Prob {0(si, fc) < a| &} . (Ill) 
Using d53b. we can write by 

= {l#Al 2 < a >"->l#,-Al a<a }- (112) 
Hence, (II lib can be expressed as 

Kt = Prob{|^0 fc | 2 <a| \cjf si c(> k \ 2 <a,---,\ct>: i _ i ct> k \ 2 <a]. (113) 



Using (111 Ob . we can write fe as 



and <f> as 



i-l 

= (114) 



i-l 

i=i 



7 Recall the definition of a which is jrp-. 
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Hence, \cjf s .cf) k \ 2 can be computed as, 



I0sA' 2 



3=1 



i-1 i-1 



Yl yt ) sv ( t > s ] ) (<f> si ,<f>k) (<t>s P <t> sl 



3=1 1=1 
l¥=3 



Defining 



Ul 



U 2 



U 3 



i-1 



3=1 

i-1 i-1 



k / ■> 



<f> Sj ) < Si , 4> k ) ( <t> Sj ,<f> 8l ), 



3=1 1=1 



we have 



W Si 4> k \ 2 = |«i| 2 + |« 2 | 2 + H 2 + 2^{u lU * 2 } + 2^{u 2 ul} + 2^{u lU * 3 }, 
where ?R.{x} denotes the real part of x. An upper bound for |0*.0 fe | 2 is given by 

\<t>* St <t> k \ 2 < k| 2 + kl 2 + N 2 + 2|«i|(|ti2| + 1« 3 |) + 2|« 2 |N- 

Having the facts that \\<p k || 2 < \\<p k \\ 2 = 1, and ||</>j"J| 2 < ||</> S J 2 = 1, we can write 

M 2 , l«i| 



\<t>:M < 



+ 2- 



<\u 2 \ + \u 3 \) + (\u 2 \ + \u 3 \y 



HtHKW 2 II^IIIKI 

O (fa 4i) + 1\jO WX)(I«2| + |«3|) + (N + N) s 



0{d>i,^ + \u 2 \ + \u 3 \ 



Also, a lower bound for |</>* 4> k \ 2 can be given as 



\4>* Si <t>k\ > \ u i\ ~ 2|ui|(|u 2 | + Kl) - 2|ita| jit 3 j 



> o (fit II^IHKII 2 - 2^0 + |«sl)ll^ 



> </>J) ||^|| 2 ||0i || 2 - 2^0 (0^0i)(|« 2 | + |«s|) - 2|«2||«s| 
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Using (11141) and (I115I) . we have 

i— 1 i— 1 i— 1 



ii0^n 2 = i - \<t>:M + E E ^) O **> > < 122) 



and 

i— 1 i— 1 i— 1 



ii^iii 2 = i - E l^AJ 2 + E E • d23) 



jr = l j=l Z=l 



Conditioned on Cj, and knowing that the set {0 S .}} =1 is e(7V) -orthogonal (or equivalently, Ma- 
orthogonal, i.e., |0g.0 s J 2 < Moc,j,l = 1, • • • from (11171) we conclude the followings: 

|« 2 | < (i-l)VMa, 

|« 3 | < (i- l)(i-2)M« 3/2 , 

||0fcf > l-(i-l)a-(i-l)(i-2)VMa 3/2 , 

||0if > l-(i-l)M«-(i-l)(i-2)M 3 V 2 . (124) 

Therefore, using (11201) . (11211) . and (11241) the upper bound and lower bound for \(p* s .cj) k \ 2 can be 
rewritten as 

\<t>lM <(y/o {<f>i, 0i) + (i - l)VMa + (i - l)(i - 2)M« 3 / 2 j , (125) 

and 



|<^</> fc | 2 > AO (0^, 0^) - 2By O {4>l, 0^) - C, (126) 

where A — (l — (i — l)a — (i — l)(i - 2) V / M« 3/2 ) (l - (i - l)Ma - (i - l)(i - 2)M V / Ma 3/2 ), 
B = (i- l)y/Ma + l)(i - 2)Ma 3 / 2 , and C = 2(i - l) 2 (i - 2)M 3 / 2 a 5 / 2 . 
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Using (fTTTT) . (fT25ll . and (fT26l) we have 
> Prob 



and 



(<^, 0i) + (i - l)VMa + (i - l)(i - 2)M, 



a 



3/2 



< a 



Prob <^ O (<f>l, 0^) < Va - (i - l)VMa + (z - l)(z - 2)Ma 



,3/2 



Prob (<^, 0i) < a - 2(i - l)^ 3 / 2 + 0(a 2 )} 



< Prob | AO 0i) - 25^/(9 0i) - C < a} 



Prob < JO (0^ , 4>jJ < 



B + y/^ 2 + A(C + a) 



(127) 



= Prob | sjo (0^,0^) < >/a + (i - l)VMa + 0(a 3/2 
= Prob{o(0^,0^) <a + 2(z-l)v / Ma 3/2 + 0(a 2 )}. (128) 

Since (fij; and <j> 8 . are the projections of <p k and S . over "P^, a (M — z + 1) -dimensional 

subspace of C A/Xl , tt^tt, and tt? tit, can be considered as uniformly distributed unit vectors in 

ll<M \\4> Si \\ 

Therefore, using Lemma 3, the probability density function for O (4> k , (f) s .) can be given 

as 



Po(*t,+i i )(*) = (M-i){l-z) 



M-i-l 



Having (1 1 29b . and using (11271) and (11281) we can write 

^o+2(j-l) v / Mo 3/2 +0(o 2 ) 



(M-i){l -z) M -'- l dz 



1 — a — Z (i - 1)^/Ma 3/2 + 0(a 2 ) 



M-i 



(M - i)a + 2(M - - l)VMa 3/2 + 0(a 2 ), 



(129) 



(130) 
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and 

v-2(i-l) v / Ma 3/2 +0(a 2 ) 



m > I (M - i)(l - z^-^dz 

-l M-i 



1 



1 - a + 2(i - l)VMa 3/2 + O 



[a 



2\ 



~ (M - ?> - 2(M - - 1) VIa 3/2 + 0(a 2 ). (131) 
From (11301) and (1131b we conclude 

m ~ (M-i)a + 0(a 3/2 ). (132) 

Appendix C; Proof of Lemma 5 

Let us define 

p = Prob{A max (JT A ) >*}, (133) 

where £ = log iV + (M + K — 1) log log AT. Using J23b . the above probability probability can be 
written as 



[i + Q(r 



p ■ - f 1 



r(M)r(x) 

[log AT + (M + if — 1) log log JV^* -2 + O ([log N} M+K - 3 ) 
r(M)T(K)e l °z JV+(M+A--i)iogio g iv 

■ o . „34) 



A^iogA^r(M)r(K) v-Wpog-W] 
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N 



1 - exp (-Np + 0(Np 2 )) 
T 1 

1 — exp 



T(M)T(K) log N 



+ 



1 - 

o 



1 - 



T(M)T(K) log 



+ 



log log A" 
[log 2 

log log A" 
[\ogN] 2 



1 

logN 



(135) 



Appendix D 

We observed that B = TiTi* is an M x M matrix whose diagonal elements behave like 
logN + f(N), where f(N) ~ o(\ogN), and its non-diagonal elements scale as 0(e(N) log A"). 
For simplicity of notation, we define 0(N) = \ogN + f(N) and <p(N) = 0(e(N) log N). 
Let us define A m as a m x m matrix whose diagonal elements scale like 0(N), and, its non- 
diagonal elements scale like (p(N). Hence, all diagonal elements of & can be written as 

det Am -i 
det Am 
It can be easily shown that 



det An = [9(N)] m + 0([9(N)r- 2 ^(N)} 2 ) 

= [\ogN} m + ([log N} m h(N)) , m = 2,---,M. 



(136) 



where h(N) = max ( e(N) ) ~ o(l). Consequently, we can write any diagonal element of 



B as 



IB' 1 } 



[log A^] M_1 + O ([logAA] M - 1 / i (AA)) 
[log N] M + O ([log N] M h(N)) 

[log AT]" 1 + O (^(^[logAT]- 1 ) . 



(137) 
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Appendix E; Proof of Lemma 6 
For the proposed method, we have seen that the achievable sum-rate can be lower-bounded 



as 



ftprop > E W Mbg 1 + 



p 

Tr {[Wl-Cf 1 ] 

> M\ogP - ME H {log(Tr {[Ti*?^]- 1 })} . (138) 



where 7i is the " selection coordinate matrix", defined in ©. 
In [35], it has been shown that 

IN| 2 KH 2 < 5(B), i = l,---,M, (139) 

where bi, i = 1, • • • , M, are the columns of B, a M x M matrix with the orthogonality defect 
8(B), and a ri , i = 1, ■ ■ ■ , M, are the columns of A = (B^ 1 )*. Similarly, we can write 

H&iflKH 2 < 5(A), i = l,---,M. (140) 

Defining B = 7i , and using the above equation, we can write 

M 

Tr ([HHT 1 ) = Eil^H 2 

8(H* 



i=i 

M 



where a i5 is the ith column of 7i*, which is equal to g*.. Having the fact that ||<7 a J| 2 > t (by 
the algorithm), we can rewrite ( 1141ft as 

Tr (j7tH*]~ 1 ) < M5 ^ i *\ (142) 
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Defining X(H) = logTr {[UWy 1 ), Y{H) = log \ '-, Z{U) = log 6(H m ), and F w {.) 



as the CDF of the random variable W, we have 



E{X(H)} < E{Y(H)} 



= \og^ + E{Z(H)} 

= log— + / zfz(-H){z)dz 
1 Jo 

M Z" 00 
= log— + / [l-F z{n) (z)]dz 
1 Jo 

M f°° 

= \og T + j^ [l-F s(w) (e z )]dz. (143) 

It can be easily shown that 5(TL*) = S(^f), where * = • ■ ■ \^m] is the matrix consisting 

of the normalized columns of 7t*, i.e., = , % — 1, • • • , M. Since the rows of Ti are 

\\rC i\\ 

chosen randomly among the pre-selected eigenvectors, and due to the fact that the eigenvalues 
of a zero-mean circularly symmetric Gaussian matrix are independent of their corresponding 
eigenvectors, \I> can be considered as a M x M matrix whose column are M randomly selected 
unit vectors. We have 



6{V) 



det(*)| 2 
1 

rM-1 ' 



(144) 



rL=i n 

where 7, is the square norm of the project of over the sub-space spanned by {^jY j=1 , T>\. 
Now, consider • • • , $if, to be an orthonormal basis for the M-dimensional space, where 
{&j}j =1 are a basis for Vi. Therefore, Vl/j+i can be represented as (V'l.i+i, • • • , V'm+Ij 0, • ' ' > 0), 
where V^i+i I s the project of \l> i+1 over <&j. In [34], the joint probability density function of 
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Using the above equation, the probability density function of 7j = U^+jJI 2 can be written as 

« ( z ) = T ^ z^h - z)™-*- 1 (146) 

W) T(i)T{M-i) 1 > ' K } 

which corresponds to the Beta distribution with parameters (i, M — i). 

Using (11441) . (11461) . and independence of 7$'s [19], we have 

Prob {6(i&) > r} < Prob jmn^ < r~^i X 

AI-l 

= l- n [i-Vif^)] > r >!> ( i4v ) 
i=i 

where / r , a (.) denotes the Incomplete Beta Function, with parameters (r, s). In [36], it has been 
shown that 

/r,,(x)= 1 Z, >\. v( / + /r+i,,-i(s), Vr, S GZ + , (148) 
1 (r + 1)1 (s) 

which incurs that 

I r>s {x) > I r+ i,s-i{x), Vx G [0, 1]. (149) 

Consequently, 

I i)M -i(x) < h,M-l( X ) 

= l-(l-x) M -\ « = !,••• ,Af-l. (150) 



Using (11501) and (I147I) . we can write, 

(M-l) 



Prob > r} < 1 - M - M -y/l/r) . (151) 



DRAFT 



52 



Combining (11431) and (I151I) . we have 

M 



E{X(H)} < log- 



t 



1 — e at-' 



(M-l) 2 



log M + '^ r -^ ( _ ir+1 ^ 



M 



m=l 

(M-l) 2 



e M -*dr 



l0g-+ £ 



m=l 



(M — 1) : 



m 



■1) 



m+l 



M-l 



??? 



M 



log- + (M-l) £ 



(M-l) 2 1 _ 1 _ 



m=l 
(M-l) 2 



< log— + M-l ~ 
t ' m 



m=l 



M 



< log— + (M-l)[21og(M-l) + l]. 



Substituting (11521) into (11381) and having t = log N, we get 

^Prop > Mlog log - M(M - l)[21og(M - 1) + 1]. 



As a result, 



lim — = 1. 



Opt 



Appendix F; Proof of Lemma 7 



Achievability of the maximum multiplexing gain 



(152) 



(153) 



(154) 



Using (II 38b . the multiplexing gain achieved by the proposed method, denoted by rp rop , can 
be lower-bounded as 

MlogP - ME H {log (Tr {[7CH]- 1 }) } 



?~Prop > lim 

P^oo 



M - M lim 

P^oo 



logP 

E^jlogTrlpttTTr 1 }} 
logP 



(155) 
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Following the proof of Lemma 6 in Appendix E, and using equations (1143b . and (11521) . and 
the union bound for the probability, we have 

(Af-1) 2 ~ 



1 — e M -i 



dr 



< logy + (M-l)[2bg(M-l) + l]^J, (156) 

where L is the number of preselected eigenvectors in the first step of Algorithm 1. Since L < 
NK, we have E w {logTr |[7i?i*] _1 }} < oo, the second term in (11551) approaches zero, and 
as a result r Prop > M. 

For the optimum strategy, the sum-rate can be upper-bounded as [37], 

^opt < ME||„||_ |log (l + ^||/f||L x ) } , (157) 

where ||i?||m ax is the maximum Frobinous norm of all channel matrices. This random variable 
can be considered as the maximum of iV x 2 (2Mi^) random variables which has the pdf of the 
form 

PlIHifcJ*) = N X % (x,MKf-\ (158) 

where j(x, MK) = t{mk) U ^ u - usm § <E3> and (11581) . we can write the upper bound 

for the sum-rate as 

f°° P r MK-l /_ \ 

^opt < Mj( \og{l + -x)N X rj ^P X) 1 (x,MK) N ^dx. (159) 
Thus, using the above equation, we have 
r 0pt = lim 

P^oo log P 

M log P + /„" M \og(^)N X f ^0 ) Xj 7 (x, MK) N ~ x dx 
~ logP 

= M. (160) 
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Since for any values of P and N, TZo pt (P, N) is the maximum achievable sum-rate , r Qpt will 
be the maximum achievable multiplexing gain in MIMO-BC. Hence, using the above equation 
and having the fact that r Prop > M, we conclude r 0pt = r Plop = M Therefore, the proposed 
method achieves the maximum multiplexing gain in MIMO-BC. 

Achievability of the optimum multiuser diversity gain 

In the proof of theorem 2, we observed that the sum-rate achieved by the proposed strategy, 
as well as the optimum one, scales like M log (■£ log N). Hence, using (11621) the multiuser 
diversity gain for the optimal scheme, denoted by d Qpt is equal to 

d Qpt = lim 



N- 



*oc r Qpt log log N 

Mlogf^logiV 

lim 

iV-»oc M log log N 

1. (161) 



and for the proposed method, 



; t ^-Prop 
"Prop = lim : : 7J 

n^oo r Prop log log N 



lim 



Vlog( ^ log N 



N^oo M log log N 
= 1. (162) 

Therefore, the proposed method achieves the maximum multiuser diversity gain in MIMO-BC. 
This, completes the proof of Lemma 7. 

Appendix G; Multiplexing Gain in Random Selection Method 

In this appendix, we prove that the Random selection strategy achieves the maximum multiplex- 
ing gain, i.e., r RS = M. For this purpose, we consider the the precoding scheme of zero-forcing 
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beam-forming. We assume that the coordinates are chosen randomly among the eigenvectors 
corresponding to the maximum singular value of each user's channel matrix. Therefore, similar 
to (1155b . we have 

E^jlogTrj^W]- 1 }} 



rRS BF > M — M lim 

' P— >oo 



lo K P 



(163) 



where H = [fl£ imax | t/f 2 , max | • • • | gJ M>max ] , and the users s h ■ ■ ■ , s M are selected randomly. 
Defining B = 7i _1 , similar to (I141I) . we can write 



M 



Tr {{nnT 1 } < 



5{W 



i=i 



\CLi 



(164) 



where cij is the iih. column of 7i*, which is equal to g Si . Noting that ||</ Si || 2 = A max (i3" s J, we 
have 



M 



6{H*) 



< 



M 

E 

i=l 



M5{W 



(165) 



Using (11431) . (I152I) . and (11651) we can write 



M 



E^jlogTrl^]- 1 }} < MMEl^TF 



,i=i 



log M + E {\og5{H*)} +E /log Jjf 



< log M + (M — 1) [2 log(M — 1) + 1] + log 



ME 



< M[21og(M- 1) + 1] +log 



x 



MK-l 



X 



exp(— x) 



, Y{MK) 
M[2 log(M - 1) + 1] - \og(MK - 1). 



dx 



(166) 



Using (11631) and (11661) . and noting that r^| BF < r^s < M, we conclude r^s = M. 
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